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Abstract 



We develop a space-time large-deviation point of view on Gibbs-non-Gibbs transitions 
in spin systems subject to a stochastic spin-flip dynamics. Using the general theory for 
large deviations of functionals of Markov processes outlined in Feng and Kurtz [TT], we 
show that the trajectory under the spin- flip dynamics of the empirical measure of the spins 
Pm i in a large block in Z"* satisfies a large deviation principle in the limit as the block size 

Mh ' tends to infinity. The associated rate function can be computed as the action functional 

r^ . of a Lagrangian that is the Legendre transform of a certain non-linear generator, playing 

"ti ' a role analogous to the moment-generating function in the Gartner-Ellis theorem of large 

deviation theory when this is applied to finite-dimensional Markov processes. This rate 
function is used to define the notion of "bad empirical measures" , which are the disconti- 
nuity points of the optimal trajectories (i.e., the trajectories minimizing the rate function) 
given the empirical measure at the end of the trajectory. The dynamical Gibbs-non-Gibbs 
transitions are linked to the occurrence of bad empirical measures: for short times no 
bad empirical measures occur, while for intermediate and large times bad empirical mea- 
sures are possible. A future research program is proposed to classify the various possible 
scenarios behind this crossover, which we refer to as a "nature- versus-nurture" transition. 
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1 Introduction, main results and research program 

1.1 Dynamical Gibbs-non-Gibbs transitions 

Since the discovery of the Griffiths-Pearce-Israel pathologies in renormahzation-group trans- 
formations of Gibbs measures, there has been an extensive effort towards understanding the 
phenomenon that a simple transformation of a Gibbs measure may give rise to a non-Gibbs 
measure, i.e., a measure for which no reasonable Hamiltonian can be defined (see van En- 
ter, Fernandez and Sokal [1], Fernandez |12] . and the papers in the EURANDOM workshop 
proceedings [26]). From the start, R.L. Dobrushin was interested and involved in this develop- 
ment; indeed, Dobrushin and Shlosman [2j, [3J proposed a programme of Gibbsian restoration, 
based on the idea that the pathological bad configurations of a transformed Gibbs measure 
(i.e., the essential points of discontinuity of some of its finite-set, e.g. single-site, conditional 
probabilities) are exceptional in the measure-theoretic sense (i.e., they form a set of measure 
zero). This has led to two extended notions of Gibbs measures: weakly Gibbsian measures 
and almost Gibbsian measures (see Maes, Redig and Van Moffaert [21]). Later, several refined 
notions were proposed, such as intuitively weakly Gibbs (Van Enter and Verbitskiy [9]) and 
right-continuous conditional probabilities. 

In Van Enter, Fernandez, den Hollander and Redig [5j, the behavior of a Gibbs measure ^ 
subject to a high-temperature Glauber spin-fiip dynamics was considered. A guiding example 
is the case where we start from the low-temperature plus-phase of the Ising model, and we run 
a high-temperature dynamics, modeling the fast heating up of a cold system. The question 
of Gibbsianness of the measure fit at time t > can then be interpreted as the existence 
of a reasonable notion of an intermediate-time- dependent temperature (at time t = the 
temperature is determined by the choice of the initial Gibbs measure, while at time t = oo the 
temperature is determined by the unique stationary measure of the dynamics). For infinite- 
temperature dynamics, the effect of the dynamics is simply that of a single-site Kadanoff 
transformation, with a parameter that depends on time. The extension to high-temperature 
dynamics was achieved with the help of a space-time cluster expansion developed in Maes and 
Netocny [22]. The basic picture that emerged from this work was the following: 

(1) /if is Gibbs for small t; 

(2) nt is non-Gibbs for intermediate t; 

(3) in zero magnetic field fit remains non-Gibbs for large t, while in non-zero magnetic field 
fit becomes Gibbs again for large t. 

Further research went into several directions and, roughly summarized, gave the following 
results: 

(a) Small-time conservation of Gibbsianness is robust: this holds for a large class of spin 
systems and of dynamics, including discrete spins (Le Ny and Redig [E]), continuous 
spins (Dereudre and Roelly ^, van Enter, Kiilske, Opoku and Ruszel, [15], [Z], [8], [23] . 
[6]), which can be subjected to Glauber dynamics, mixed Glauber/Kawasaki dynamics, 
and interacting-diffusion dynamics, not even necessarily Markovian (Redig, Roelly and 
Ruszel [25j), appliedto a large class of initial measures (e.g. Gibbs measures for a finite- 
range or an exponentially decaying interaction potential). 



(b) Gibbs-non-Gibbs transitions can also be defined naturally for mean-field models (see 
e.g. Kiilske and Le Ny [T3| for Curie- Weiss models subject to an independent spin-flip 
dynamics). In this context, much more explicit results can be obtained: transitions 
are sharp (i.e., in zero magnetic field there is a single time after which the measure 
becomes non-Gibbs and stays non-Gibbs forever, and in non-zero magnetic field there is 
a single time at which it becomes Gibbs again) . Bad configurations can be characterized 
explicitly (with the interesting effect that non-neutral bad configurations can arise below 
a certain critical temperature). For further developments on mean-field results see also 



(c) Gibbs-non-Gibbs transitions can also occur for continuous unbounded spins subject to 
independent Ornstein-Uhlenbeck processes (Kiilske and Redig |17j). and for continuous 
bounded spins subject to independent diffusions (Van Enter and Ruszel [7], [8]), even in 
two dimensions where no static phase transitions occur. 

Bad configurations can be detected by looking at a so-called two-layer system: the joint 
distribution of the configuration at time f = and time t > 0. If we condition on a particular 
configuration r/ at time i > 0, then the distribution at time t = is a Gibbs measure with an in- 
dependent Hamiltonian H"^ , which is a random-field modification of the original Hamiltonian 
H of the starting measure. If, for some r/, H^ has a phase transition, then this 77 is a bad 
configuration (see Fernandez and Pfister |13|). 

1.2 Nature versus nurture 

While these results led to a reasonably encompassing picture, we were unsatisfied with the 
strategy of the proofs for the following reason. All proofs rely on two fortunate facts: (1) 
the evolutions can be described in terms of space-time interactions; (2) these interactions cor- 
respond to well-studied models in equilibrium statistical mechanics. In particular, although 
the most delicate part of the analysis - the proof of the onset of non-Gibbsianness - was 
accomplished by adapting arguments developed in previous studies on renormalization trans- 
formations, the actual intuition that led to these results relied on entirely different arguments, 
based on the behavior of conditioned trajectories. These intuitive arguments, already stated 
without proof in our original work [5j, can be summarized as follows: 

(I) If a configuration 77 is good at time t (i.e., is a point of continuity of the single-site 
conditional probabilities), then the trajectory that leads to t/ is unique, in the sense that 
there is a single distribution at time t = that leads to rj at time t > 0. In particular, 
if t is small, then this trajectory stays close to r/ during the whole time interval [0,t]. 

(II) If a configuration rj is had at time t (i.e., is a point of essential discontinuity of the single- 
site conditional probabilities), then there are at least two trajectories compatible with 
the occurrence of r] at time t. Moreover, these trajectories can be selected by modifying 
the bad configuration rj arbitrarily far away from the origin. 

(Ill) Trajectories ending at a configuration r] at time t are the result of a competition between 
two mechanisms: 

— Nature: The initial configuration is close to 7?, which is not necessarily typical for 
the initial measure, and is preserved by the dynamics up to time t. 



— Nurture: The initial configuration is typical for the initial measure and the system 
builds rj in a short interval prior to time t. 

As an illustration, let us consider the low-temperature zero-field Ising model subject to 
an independent spin-flip dynamics. In [5J we proved that the fully alternating configuration 
becomes and stays bad for large t. This fact can be understood according to the preceding 
paradigm in the following way. Short times do not give the system occasion to perform a large 
number of spin-flips. Hence, the most probable way to see the alternating configuration at 
small time t is when the system started in a zero-magnetization-like state and the evolution 
kept the magnetization zero up to time t. This is the nature- scenario! For larger times t, a 
less costly alternative is to start in a less atypical manner, and to arrive at the alternating 
configuration following a trajectory that stays close for as long as possible to the unconditioned 
dynamical relaxation. This is the nurture- scenario! In this situation, we can start either from 
a plus-like state or a minus-like state, as the difference in probabilistic cost between these 
two initial states is exponential in the size of boundary, and thus is negligible with respect 
to the volume cost imposed by a constrained dynamics. It is then possible to select between 
the plus-like and the minus-like trajectories by picking the alternating configuration in a large 
block, then picking either the all-plus or the all-minus configuration outside this block, and 
letting the block size tend to infinity. 

We see that the above explanation relies on two facts: 
(i) The existence of a nature-versus-nurture transition, as introduced in [5]. 

(ii) The existence of several possible trajectories (once the system is in the nurture regime), 
all starting from configurations that are typical for the initial measure (modulo an 
boundary-exponential cost). These trajectories evolve to the required bad configuration 
over a short interval prior to time t. 

1.3 Large deviations of trajectories 

The goal of the present paper is to put rigor into the above qualitative suggestions. We 
propose two novel aspects: 

(1) the development of a suitable large deviation theory for trajectories, in order to estimate 
the costs of the different dynamical strategies; 

(2) the use of empirical measures instead of configurations, in order to express the condi- 
tioning at time t. 

For a translation-invariant spin-flip dynamics and a translation-invariant initial measure, 
nothing is lost by moving to the empirical measure because the bad configurations form a 
translation-invariant set. Instead, a lot is gained because, as we will show, the trajectory of 
the empirical measure satisfies a large deviation principle under quite general conditions on 
the spin-flip rates (e.g. there is no restriction to high temperature). Moreover, the question 
of uniqueness versus non- uniqueness of optimal trajectories (i.e., minimizers of the large de- 
viation rate function) can be posed and tackled for a large class of dynamics, which places 
the dynamical Gibbs-non-Gibbs-transition into a framework where it gains more physical 
relevance. 

Here is a list of the results presented in the sequel. 



(A) Existence of a large deviation principle for trajectories. We apply the theory developed 
in Feng and Kurtz [11], Section 8.6. The rate function is the integral of the Legendre 
transform of the generator of the non-linear semigroup defined by the dynamics. In 
suitably abstract terms, this generator can be associated to a Hamiltonian, and the rate 
function to the integral of a Lagrangian (Sections [2H5]). 

(B) Explicit expression for the generator of the non-linear semigroup of the dynamics. These 
are obtained in Theorems I3.11[3T2] below (Section [3]). 

(C) Rate functions for trajectories and associated optimal trajectories. The general Legendre- 
transform prescription is explicitly worked out for a couple of simple examples, and 
optimal trajectories are exhibited (Sections l4.2H4.3p . 

(D) Relation with thermodynamic potentials. Relations are shown between the non-linear 
generator and the derivative of a "constrained pressure". Similarly, the rate function 
per unit time is related to the Legendre transform of this pressure (Section 15. 2p . 



(E) Definition of had measures. This definition, introduced in Section [6l is the transcription 
to our more general framework of the notion of had configuration used in our original 
work [5]. In Section [7] we discuss the possible relations between these two notions of 
badness. 



1.4 Future research program 

The results in (A)-(E) above are the preliminary steps towards a comprehensive theory of 
dynamical Gibbs-non-Gibbs transitions based on the principles outlined above. Let us con- 
clude this introduction with a list of further issues which must be addressed to develop such 
a theory: 

• Definition of "nature-trajectories" and "nurture-trajectories" . This is a delicate issue 
that requires full exploitation of the properties of the rate function for the trajectory. 
It must involve a suitable notion of distance between conditioned and unconditioned 
trajectories. 

• Relation hetween nature-trajectories and Gihhsianness. It is intuitively clear that 
Gibbsianness is conserved for times so short that only nature-trajectories are possible. 
A rigorous proof of this fact would confirm our intuition and would lead to alternative 
and less technical proofs of short-time Gibbsianness preservation. 

• Study of nurture-trajectories. We expect that nurture-trajectories start very close to 
unconstrained trajectories, and move away only shortly before the end in order to satisfy 
the conditioning. For the case of time-reversible evolutions, the time it takes to get to the 
nurture-regime should be the same as the initial relaxation time to (almost) equilibrium. 

• Study of nature-nurture transitions. Transitions from nature to nurture should happen 
only once for every conditioning measure (i.e., there should be no nature-restoration). 
Natural questions are: Does the time at which these transitions take place depend on 
the conditioning measure? Is there a common time after which every trajectory becomes 
nurture? 



• Case studies of trajectories leading to non-Gibbsianness. These should determme "for- 
bidden regions" in trajectory space. Natural questions are: How do these regions evolve? 
Are they monotone in time? 

• Relation between nurture-trajectories and non-Gibbsianness. While we expect that "all 
trajectories are nature" implies Gibbsianness of the evolved measure, we do not expect 
that "some trajectories are nurture" leads to non-Gibbsianness. Examples are needed 
to clarify this asymmetry. The case of the Ising model in non-zero field - in which 
Gibbsianness is eventually restored - should be particularly enlightening. 



1.5 Outline 

Our paper is organized as follows. In Section [51 we consider the case of independent spin-flips, 
as a warm-up for the rest of the paper. In Section [3l we compute the non-linear generator 
for dependent spin- flips, which plays a key rol in the large deviation principle we are after. 
In Sections H] and [5l we compute the Legendre transform of this non-linear generator, which 
is the object that enters into the associated rate function, as an action integral. In Section d] 
we do the computation for independent spin-flips, in Section [5] we extend the computation 
to dependent spin- flips. In Section [6l we look at bad measures, i.e., measures at time t > 
for which the optimal trajectory leading to this measure and minimizing the rate function is 
non-unique. In Section [71 we use these results to develop our large-devation view on Gibbs- 
non-Gibbs transtions. In Appendix[A]we illustrate the large deviation formalism in Feng and 
Kurtz [llj, which lies at the basis of Sections [2H5l by considering a simple example, namely, a 
Poisson random walk with small increments. This will help the reader not familiar with this 
formalism to grasp the main ideas. 



2 Independent spin-flips: trajectory of the magnetization 

2.1 Large deviation principle 

As a warm-up, we consider the example of Ising spins on the one-dimensional torus T/v = 
{1,...,N} subject to a rate-1 independent spin-flip dynamics. Write Fjy to denote the 
law of this process. We look at the trajectory of the magnetization, i.e., t i— )• ?7ijv(i) = 
N^'^ X]i=i o'i{t), where ai{t) is the spin at site i at time t. A spin-flip from -|-1 to —1 (or from 
—1 to -|-1) corresponds to a jump of size —2N~^ (or -|-2A^~^) in the magnetization, i.e., the 
generator Ljv of the process {miy{t))t>o is given by 

(L^/)(m) = l^N[f{m- IN'^) - f{m)] + l^ N [f {m + 2N-') - f{m)] (2.1) 

for m G {—1, —1 + 2A^^^, . . . , 1 — 2A^^-^, 1}. If lim7v-s.oo "^7V = "z and / is C^ with bounded 
derivative, then 

lim {LNf){mN) = {Lf){m) with (L/)(m) = -2mf'{m). (2.2) 

This is the generator of the deterministic process m{t) = r?T,(0)e^^*, solving the equation 
m{t) = —2m(t) (the dot denotes the derivative with respect to time). 



The trajectory of the magnetization satisfies a large deviation principle, i.e., for every time 
horizon T G (0, oo) and trajectory 7 = (7t)j(=[o,T]) 



IPjv(("iiv(t))^gjQ^^j ^ (7t)te[o,T]j ~ exp 



T 



N L{jt,it)dt 



(2.3) 



where the Lagrangian t 1— )■ L(7t,7t) can be computed following the scheme of Feng and 
Kurtz [llj, Example 1.5. Indeed, we first compute the so-called non-linear generator H 
given by 

{Hf){m)= Wit, {-HNmniN) with CHAr/)(mAr) = ^ e'^^^'"'^) LAr(e^^)(m^), (2.4) 

W— >oo iV 

where lim7v->oo f^N = ?7i. This gives 

{Hf){m) = ^ (e-2/'M _ 1) + 1^ (g^/'M _ 1), (2.5) 

which is of the form 

{Hf){m) = H{m,f'{m)) (2.6) 

with 

H{m,p) = ^{e-^P-l) + ^{e^P-l). (2.7) 

Because p i-7> H{m,p) is convex, we have 

H{m,p) = sup Ipq — L{m, q)] (2.8) 

with 



L{m, q) = sup [pq — H{m,p)] 



q q + ^q2 + 4{l-m^) \ 1 y 3- 



(2.9) 



Hence, using the theory developed in Feng and Kurtz [TT|, Chapter 1, Example 1.5, we indeed 
have the large deviation principle in (j2.3p with L{"ft^it) given by (|2.9p with m = 'jt and q = 'jt- 

2.2 Optimal trajectories 

We may think of the typical trajectories (wiAf(i))tg[o,T] ^^ being exponentially close to optimal 
trajectories minimizing the action functional 7 = (7t)te[o,T] ^ Jq L{'^f,it)dt. The optimal 
trajectories satisfy the Euler-Lagrange equations 

d dL dL . . 

dt djt o-ft 



or, equivalently, the Hamilton-Jacobi equations corresponding to the Hamiltonian in ([27 

dH dH , ^ 

m = ^-, p=--—, (2.11) 

op om 

which gives 

rh = -m(e2P + e-2P) + (e2P-e-2P), p = Ue^P - e-^^) . (2.12) 



Putting h = tanh(p) and integrating the second equation in (|2.12p . we obtain 

h{t) = Ce'^\ (2.13) 

Using that arctanh(a;) = ^log(^^), we get 

m = -m^-^ + ^-^, (2.14) 

which can be integrated to yield the solution 

m{t) = Cie^* + Cae-^*, (2.15) 

where the constants Ci, C2 are determined by the initial magnetization and the corresponding 
initial momentum. One example of an optimal trajectory corresponds to the dynamics starting 
from an initial magnetization ttiq, giving m{t) = ?n,oe~^*, i.e., Ci = and C2 = rriQ. Another 
example of an optimal trajectory is the reversed dynamics arriving at magnetization mT at 
time T, giving m{t) = mTe^^*~'^' , i.e., C2 = and Ci = mTe~'^^ ■ 

Yet another example is the following. Suppose that we start the independent spin-flip 
dynamics from a measure under which the magnetization satisfies a large deviation principle 
with rate function, say, /, e.g. a Gibbs measure. If we want to arrive at a given magnetization 
rriT at time T, then the optimal trajectory is given by ()2.15p with end condition m{T) = niT 
and satisfying the open-end condition relating the Lagrangian L at time i = to the rate 
function / at magnetization m = 70 as follows: 



(9L(7f,7f) 
dit 



t=o 



dl{m) 
dm 



(2.16) 

m=7o 



This condition is obtained by minimizing 7 1— t- /(70) + /q L{'jt,'jt) dt (see Ermolaev and 
Kiilske [lOj). 



3 Trajectory of the empirical measure for dependent spin-flips 

We will generalize the computation in Section[2]in two directions. First, for independent spin- 
flips we are confronted with the problem that the rate at which the average of a local observable 
changes in general depends on the average of other observables. Second, for dependent spin- 
flips even the trajectory of the magnetization is not Markovian. Therefore, we are obliged to 
consider the time evolution of all spatial averages jointly, i.e., the empirical measure. 

3.1 Setting and notation 

For N en, let T^ be the d-dimensional iV-torus {Z/{2N + 1)Z)"'. For i,j G T^, let i + j 
denote coordinate-wise addition modulo 2N + 1. We consider Glauber dynamics of Ising 
spins located at the sites of T^, i.e., on the configuration space Qj\[ = {— 1>1}^'^- We write 
fi = {— 1, 1}^ to denote the infinite- volume configuration space. Configurations are denoted 
by symbols like a and rj. For a G O^r, cjj denotes the value of the spin at site i. We write 
7Wi(i7) to denote the set of probability measures on Q, and similarly for A4i{Qn)- 



The dynamics is defined via the generator Ljv acting on functions / : 0,^ — )• R as 

(L^/)(a)=^c.(a)[/(0-/(a)], (3.1) 



where a* denotes the configuration obtained from a by flipping the spin at site i. The rates 
Ci{a) are assumed to be strictly positive and translation invariant, i.e., 

Ciicr) = Co(Tjcr) = c(rjO-) with (rjO-)^ = ctj+j. (3.2) 

We think of the dynamics with generator Ljv as a finite- volume version with periodic boundary 
condition of the infinite- volume generator 

{Lf){a) = Y,c.{a)[f{a')-fia)], (3.3) 

where now / is supposed to be a local function, i.e., a function depending on a finite number 
of aj, j € Z*^. We denote by {St)t>o with St = e*^ the semigroup acting on C(J7) (the space 
of continuous functions on il)) associated with the generator in ()3.3p . and similarly {S^)t>o 
with S^ = e*^^. For fi G A^i(ri), we denote by i^St G ^Al{Q.) the distribution /i evolved over 



N)- 



time t, and similarly for unS^ and ^uat G 7V(i(ri 

We embed T^ in Z'^ by identifying it with A^ = ([—A'', A''] n Z)*^. Through this identifica- 
tion, we give meaning to expressions like X^jg-j-d /(tiCt) for a G 0, and /: i7 — ?> M. In this way 
we may also view local functions / : — >■ M as functions on ri^r as soon as A^ is large enough 
for A^ to contain the dependence set of /. For a translation-invariant fj, G 7Wi($7), we denote 
by fj-N its natural restriction to Qn- 

By the locality of the spin-fiip rates, the infinite- volume dynamics is well-defined and is 
the uniform limit of the finite- volume dynamics, i.e., for every local function /: fi — )■ M and 
t >0, 

hm ||5f/-5i/|loo = 0. (3.4) 

See Liggett pOj, Chapters 1 and 3, for details on existence of the infinite- volume dynamics. 

3.2 Empirical measure 

For A^ G N and a G JIat, the empirical measure associated with a is defined as 

This is an element of Mi{0,n) which acts on functions /: il„ — t- M as 



(/ 






^' i&T% 



As already mentioned above, a local / : fi ^- M may be considered as a function on 0,^ for A^ 
large enough. A sequence {iJ,iy)j\[m with fijy G J^i{Qn) converges weakly to some ^ G M.i{0,) 
if 

lim / fd^iN = / fdjjL V/ local. (3-7) 



For a £ Q, we define its periodized version a^ as af = a-i for i = [ii, . . . ,id) with —N < ik < 
N for fc E {1, . . . , d}, and af = 0"jjnod(2Af+i) otherwise. 

If /i is ergodic under translations, then by the locahty and the translation invariance of 
the spin- flip rates also ^5^ is ergodic under translations. Let ij,^ be the distribution of a^ 
when a is drawn from /x. Since the semigroup {S^)t>o uniformly approaches the semigroup 
{St)t>o as N ^ oo, the ergodic theorem implies that 

CN{cr^{t)) -^ liSt weakly as A^ ^ oo, (3.8) 

where CT^{t) denotes the random configuration that is obtained by evolving a^ over time t in 
the process with generator Ljy. 

The deterministic trajectory t i-)- fiSt is the solution of the equation 

— = L M„ (3.9) 

where L* denotes the adjoint of the generator acting on the space of finite signed measures 
A4{Q). Thus, we can view the convergence in ()3.8p as an infinite-dimensional law of large 
numbers, where the random measure-valued trajectory {CnUct^ {i)))t£[o,T] converges to the 
deterministic measure- valued trajectory {fj,St)t£[o,T]- It is therefore natural to ask for an 
associated large deviation principle, i.e., does there exist a rate function 7 1-^ 1(7) such that 

P^((£^((a^(t)))^^[o,T] ^ 7) ^ exp[-|Tl|/(7)]? (3.10) 

Inspired by the example of the magnetization described in Section [2l we expect the answer to 
be yes and the rate function to be of the form 

1(7) = / L{jt,it)dt (3.11) 

Jo 

for some appropriate Lagrangian L. In order to compute L, we must first find the generator 
of the non- linear semigroup. 

3.3 The generator of the non-hnear semigroup 

In our setting the non-linear generator is defined as follows: 

{H^F){C^{a)) = -^ e-l^-l^(^-('^))L^ (eini(i^°^-)) (<,). (3.12) 

If the expression in ()3.12p has a limit {HF)(fj,) as A^ — )• 00 when CN{a) -^ fi weakly, then a 
candidate rate function can be constructed via Legendre transformation (see Section [S]). 

To compute the limit operator, we start with a simple function of the form 

F{CN{cT)) = {f,CN{a)), (3.13) 

where /: O — >■ M is a local function. Such /'s are linear combinations of the functions 

HA{a) = ]J(^i, AQZ'^ finite, (3.14) 

which live on ri^r for A^ large enough. 

10 



Theorem 3.1. For all local f €z 0, and N large enough, 

J_ ,-\n\iLCM^)) Lj, (el^^K/'^-)) (a) = (c (e^-/ - l) , £^(a)) , (3.15) 

where T>n is the linear operator, acting on functions on Q]\f, defined via 

VnI = 0, VnHa = Y. (-2)^A+r for A C T^, (3.16) 



re-A 



where the N -dependence refers only to the fact that the addition A + r is modulo 2N + 1. 
Proof. Using the definition of the generator L^v in (|3.ip , we write (recall 



^ c(rfecr) I exp 



k&T% 



E [/(^.(^'))-/(^.(^)) 



,iGT^ 



(3.17) 



Since 



iV%f)ia)= j; [/(r,(a'=))-/(T,(a)) 



ien 



is a linear operator, it suffices to prove that 

{V%f){a) = {VNf){Tko) for / = Ha, 



(3.18) 



(3.19) 



where V^f is given by ()3.16p for / = Ha (note that if / = Ha, then f{cr) = —HAicr) for 
k E A and /(o"'^) = f{a) otherwise). Hence 



iV%HA)ia) 



E (n(^')^+^-n-'+^) = E i{/c-.eA} (-2) n 



O", 



i+j 



E hje-A+k} (-2) n^i+J = (-2) E H'^^ 



i+r+k 



im 



i£A 



r£-Ai&A 



(3.20) 



((-2) ^ //A+r)(Tfca). 
\ re-A / 



D 



Remark: Note that, in the limit as A^ — > oo, V^ becomes an unbounded operator, defined 
on local functions / : $7 ^- M via 



VI = Q, V 



fE«^^^)=E^4E(-2; 

\ A J A \r(^-A 



OiA- 



The domain of T) can be extended to functions / = ^^ uaHa for which 

E E i"^-''! < °°- 



(3.21) 



(3.22) 



A r<^-A 
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The dual operator D* acts on A4{Q), the space of finite signed measures on 0, and since 
PI = 0, T>* has the measures of total mass zero as image set. The intuitive idea is that when 
the dynamics starts from the empirical measure n, after an infinitesimal time t the empirical 
measure is ^ + tV*fi + o(t). 

Remark: From Theorem 13.11 it follows that, for / = X^j^^ Kfi, 

(3.23) 
The right-hand side is a function of Cn. By taking derivatives with respect to the variables 
\i, we see that the generator maps any function of Cn into a function of Cn. This shows that 
{Cn{(^^ i't)))t>o is a Markov process. Roughly speaking, this Markov process can be viewed 
as a random walk that makes jumps of size 1/|T^| at rate |T^|. Of course, the problem is 
that this random walk is infinite-dimensional, and therefore we cannot directly apply standard 
random- walk theory. 

Theorem 13.11 shows that the operator H defined by 



{HF){ii) = lim (ffjvF)(>CAr((T)) when lim Cn = ^ weakly 
is well-defined for F{p) = {f,iJ-). We next extend Theorem 13.11 to F of the form 

where ^ : M" ^^ M is C°° with uniformly bounded derivatives of all orders. 



(3.24) 



(3.25) 



Theorem 3.2. // lim^v-i-oo ^A^ = ^ (md F is of the form (j3.25p . then (with the same notation 
as in (fXT2D ) 



{HF)i^i) = hm {HNF){fi) = ( c exp 



5^ 



E7>r((/i'^)'---'(/-/^))^/^ 



.4 = 1 



1 ,^ 



(3.26) 



Proof. Compute 
1 



I^Afl 



-|T5^|F(£^(<x))^^ L\'^%\F{Cn)\ (^^^ 



Y, cina) (exp [|T^| (f(£jv(<t')) - F(/:jv(ct; 



1 . 



keT% 



Next, use the fact that 



(3.27) 



{f,CNia''))-{f,CN{a)) 



1 






{VNf){rk{a)) 



to see that 

*((/i,£^(a'=)), . . . , (/„,/:^(a'=))) - ^((/i,/:jv(^)), . . . , {fn.CNia))) 

v^ ^^ / 1 \ 

Combine ([STTH' and ^^71^ and take the limit A^ ^ oo, to obtain ^^7M . 



(3.28) 



(3.29) 



D 
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Remark: For 

the functional derivative of F with respect to fi is defined as 

We may therefore rewrite (|3.29p as 

H{F){f,) = (c fexp 






1 ,/^ • 



(3.30) 



(3.31) 



(3.32) 



4 The rate function for independent spin-flips 

4.1 Legendre transform 

Having completed the computation of the non-linear generator in Section [3l we are ready to 
compute its Legendre transform. As a warm-up, we will first do this for independent spin-flips, 
i.e., when c = 1 in ()3.2p . In Section [5] we will extend the calculation to general c, which will 
not represent a serious obstacle. 

The non-linear generator in (|3.26p is of the form 



where, for fi € A^i($7) and /: il — > M continuous. 
By the convexity of / i— ;■ ^{{fj,, /), we have 



6F_ 
6fj, 



c e 



1 ,^ • 



'^(M:/)= sup 



with 



L{fi,a) 



sup 

/eC(Q) 



/ f da — L{fi, a) 
[ f da -nil, J) 



(4.1) 



(4.2) 



(4.3) 



(4.4) 



the Lagrangian appearing in the large deviation rate function in (|3.1ip . As explained in Feng 
and Kurtz [llj, Section 8.6.1.2, the representation of the generator in ()4.ip . where 'H(/x,/) is 
a Legendre transform as in ()4.3p . implies that the generator in (|4.ip generates a non-linear 
semigroup, called the Nisio control semigroup, associated with the function L (see IllJ. Section 
8.1). 



Remark: The operator T> has the property 

Vfo = -2/o, 



/o(^) 



-70, 



(4.5) 



i.e., /o is an eigenfunction of T>. We recover the Hamiltonian in ()2.7p (associated with the 
large deviation principle of the magnetization) from the infinite-dimensional Hamiltonian in 
[21) via the relation 

n{fi,pfo) = H{{fo,fi),p). (4.6) 
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Remark: The infinite-dimensional Hamiltonian in (j4.2p can be thought of as a function of the 
"position" variable // and the "momentum" variable /. The corresponding Hamilton-Jacobi 
equations read 

■ ^^ i ^H ,^^. 

^=Jf^ ^ = -V ^'-'^ 

These give a closed equation for /, because the Hamiltonian in (j4.2p is linear in //. If we 
can solve the latter equation to find /, then we can integrate the equation for jjl and find the 
solution for //. This is precisely the same situation - but now infinite-dimensional - as we 
encountered in (j2.12p . where the equation for p was closed and could be integrated to give the 
solution for m. 



4.2 Computation of the Lagrangian 



To find L, the function appearing in the rate function in ()3.1ip . we have to compute the 
Legendre transform in (j4.4p . To do so, we first consider the finite-dimensional analogue. We 
start with rates c = 1, for which ()4.4p becomes 



L(/i, a) = sup V fiai - V [e^l=^ ^''^' - 1 I /x. 
H = (//!,...,//„), a = (ai,...,a„), / = (/i,...,/„) 



(4.^ 



where /ij G (0, oo), ^"=i /f^i = 1, Oj G IK, /j G M, and Dij G M. The matrix D has the 
additional property that D{1) = 0, where 1 is the vector with all components equal to 1. 
Hence X^"=i(-D-^//)j = 0, i.e., the transposed matrix D^ maps any vector to a vector with 
zero sum. For a vector a, we say that {D^)~^a is well-defined if there exists a unique vector 
z/ = z^(a) with sum equal to 1 such that D u = a. For two column vectors a,P £ W^, let a/3 
be the vector with components ai/3j, a//3 the vector with components ai/ Pi. For /: M — )• M, 
write /(a) to denote the vector with components f{oii). Then the equation for the maximizer 
/ = /* of (g^D becomes 



Oik 



^ /ii eS?=i ^'^^/ Afe, fc = 1, . . . , n, 



i=l 



which in vector notation reads 



a = D {fie 



Df* 



If {D'^) ^a is well-defined, then we can rewrite the latter equation as 



Df* = log 
and for this /* we have 

Y^f*ai = {f\a) = (log 
and 






fJ' 



i=l 



(D 



T\-l 1 



fJ' 



{D 



T\-l 



a 



^ ('e^"=i D^Jf* _ l^ ^. = 0, 



(4.9) 
(4.10) 

(4.11) 

(4.12) 
(4.13) 



j=i 
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because the total mass of fi and (-D^) ^ are both equal to 1. Hence, inserting (j4.12p and 
(|4.13p into (|4.8|) . we obtain the expression 



L(/i, a) = ( log 



(D^r'a), (4.14) 



which is the relative entropy of {D^)^^a with respect to fj,. The intuition behind (|4.14|) is that 
L{^,a) is the cost under the Markovian evolution for the initial measure to have derivative a 
at time zero. 

Let us next consider the infinite-dimensional version of the above computation. First, for 
a € A4{il) with total mass zero, we declare {'D*)~^a = z^ to be well-defined if there exists a 
probability measure v such that, for all / in the domain of V, 

{u,Vf) = {a,f). (4.15) 

If a is translation-invariant, then also {T>*)~^a is translation-invariant. For translation- 
invariant fJ.,!^ (z A4{Q), we denote by s{u\fj,) the relative entropy density of v with respect to 
H, i.e.. 



siulu) = lim - — TT > '^(o"Td)log 



iy{ar^. 



K^^Tt) 



(4.16) 



Note that this limit does not necessarily exist. But if /i is a Gibbs measure, then for all 
translation-invariant u both s(z/|/x) and s^iylfit) exist, where nt is /x evolved over time t (see 
van Enter, Fernandez and Sokal [3], Le Ny and Redig |19]). The rate function which is the 
analogue of (|4.14|) is now given by 

L{fi,a) = s{{V*)-^a\fi) (4.17) 

with the same interpretation as for (|4.14p : (P*)^^a produces derivative a at time zero for 
the trajectory of the empirical measure, and its cost is the relative entropy density of this 
measure with respect to the initial measure //. 

4.3 Optimal trajectories 

In order to gain some intuition about the rate function corresponding to the Lagrangian in 
(j4.17p . we identify two easy optimal trajectories. 

First, we consider a trajectory that starts from a product measure u^f^ and ends at a 
product measure i/^t with xt = xoe"^*. The typical trajectory is then simply the product- 
measure- valued trajectory 74 = Ux^ with xt = xqc"^*. We can easily verify that this trajectory 
has zero cost. Indeed, {'^i^Ha) = x\ , and hence {'^i^Ha) = \A\x\ Xf On the other hand, 
{V* {■^t) , H a) = —2\A\x\ and, since Xg = — 2xt, we thus see that (jijHa) = {V* {'^t) ^ H a) ■ 
Therefore {V*)-^{^t) = It, and ()il7|) gives 

L{lt,it) = s {{V*)-\^t)\lt) = s{-it\lt) = 0. (4.18) 

Note that this is the only product-measure-valued trajectory that has zero cost. Indeed, if 
7t = t^xt has zero cost, then from the requirement that {jijHa) = {p* {'~it) , H a) = —2\A\x\ 
we find that xt = —2xt- For a general starting measure /i, the trajectory that has zero 
cost satisfies {■jt,HA) = —2\A\{'yt,HA), which has as solution {j^Ha) = (/x, ^/^a) e"^'^'*, 
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corresponding to the Markovian independent spin- flip evolution started from fi. Note that, 
for a general trajectory 7, ((D*)-i(7t), F^) = -2\A\{jt,HA). 

Second, we consider the case where /i = ^y is a product measure with 



(/iy, Fa) = 2/1^1, -l<y<l, 

and a = a^^ is the derivative at time zero of another product measure, i.e., 

(a^,FA) = -2|yl|xl^l, -l<x<l. 



(4.19) 



(4.20) 



In that case D*a = Vx with v^ the translation-invariant product measure with {vx,Ha) = x' L 
The latter follows from the identity 



Y,HA{o')-HA{a) 



i&A 



and the rate function becomes 



L{fiy,ax) 



1 + x 



log 



1 + x 
TTy 



-2M|xl^' 



X , / 1 — X 
■log 



1-y 



(4.21) 



(4.22) 



5 The rate function for dependent spin-flips 

5.1 Computation of the Lagrangian 

For general spin- flip rates c in (j3.2p , let us return to the matrix calculation in 
Equation ()4.8p has to be replaced by 



L{fi,a) 



sup 



E/^^^-E^^f^^^^^"""'^-! 



fj'i 



j=i 



i=l 



and (US 



(5.1) 



where c,- > 0, i = 1, 



, n. 



Put C^ = Y17=i ^il^i- I^ the calculation with c, = 1, i = 1, 



,n, 



this "total mass" does not depend on // and is equal to 1. Now, however, it becomes a 
normalization that depends on /x. We say that {D )~^{a, fi) is well-defined if there exists a 
non-negative vector u = u{a, fi) = {ui . . . , Vn) with sum C^ such that D^'^u = a. The analogue 
of ()4.14p reads 

■(Z)^)-i(a,/x)" 



L{^x,a) 



log 



1-^ 



{D'r'{a,pi) 



(5.2) 



In order to find the analogue of this expression in the infinite-dimensional setting, we 
proceed as follows. For two finite positive measures ^u, u of equal total mass M, we define 
S{fj.\i') to be the relative entropy density of the probability measures fi/M, v/M, i.e., S{iJ,\i') = 
s{v/M\^/M). For ^ G M.i{Q), we define the c-modification of /x as the positive measure 
defined via J^ f{a)dfic{(^) = J^ c{a)f{a)d^{a). For a signed measure of total mass zero and 
/x € A^i($7), we say that {V*)^'^{a,^,c) = v is well-defined if there exists a positive measure 
V of total mass equal to that of /ic such that T)*{u) = a. Then the analogue of ()5.2p becomes 



L(/x,q) = s((P*) ^(a,/x,c)|/ic) ■ 



(5.3) 
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5.2 The non-linear semigroup and its relation with relative entropy 

The non-linear semigroup with generator ()3.12p is defined as follows. Let P™^(il) be the set 
of translation-invariant probability measures on 0,. For local functions /i, ...,/„ : fi ^^ M and 
a C°°-function ^: M" ^ M, we define an associated function F^i'-'-/'^ : 'P''^''{n) -^ R via 

41 '•••'/" (^) = ^( [ fidfi, ...J fndfl) . (5.4) 

Since {fi^C^) is well-defined for N large enough, we can define F^^'"'-^"{C]\[) for N large 
enough as well. This allows us to define a non-linear semigroup (y{t))t>o via 

(F(t)4^'-'^")(/.) = hm -^ logE,^ (exp flT^lF^^-'^" (/:^(a^(t)))j ) , (5.5) 

where E^-jv denotes expectation with respect to the law of the process starting from a^ , and 
the limit is taken along a sequence of configurations {a^)]\[£N with u^ G ^n such that the 
associated empirical measure C^icr^) converges weakly to /u as A^ — > 00. If V{t) exists, then 
it defines a non-linear semigroup, and the generator of V{t) is given by ()3.32p . 

Conversely, if H in ()3.32p generates a semigroup, then this must be {V{t))t>o- The fact 
that this semigroup is well-defined is sufficient to imply the large deviation principle for the 
trajectory of the empirical measure (Feng and Kurtz |llj . Theorem 5.15). Technically, the 
difficulty consists in showing that the generator in p.32p actually generates a semigroup. 

We now make the link between the non-linear semigroup, its generator and some familiar 
objects of statistical mechanics, such as pressure and relative entropy density. 

Definition 5.1. The constrained pressure at time t associated with a function /: — )• M and 
a Gibbs measure fi € ^^^(ri) is defined as 

Mm = lim -i- logE^iv (e^^^^N^-f^-'A , (5.6) 

where the limit is taken along a sequence of configurations (cj^)ArgN with a^ G ^n such that 
the associated empirical measure Cn{<7^) converges weakly to fi as N ^ 00. 



In particular, po{f\fJ,) = j^fd^. The relation between the non-linear semigroup in (j5.5p 
and the constrained pressure at time t reads 

(y(t)(/,.))(/i)=p,(/|M). (5.7) 

The pressure at time t is defined as 

piflfit) = ^1™^^ log^M (e^^^-iv^^^^'^*)) . (5.8) 

This is well-defined as soon as the dynamics starts from a Gibbs measure //q = M (see Le Ny 
and Redig [19j). The relation between the pressure and the constrained pressure reads 

p{f\fit)= sup \ptif\iy) - sii^lfi)]. (5.9) 
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On the other hand, the pressure at time t is the Legendre transform of the relative entropy 
density with respect to fit, i.e., 



Pif\lJ-t 



sup 



fdi' - s{i^\fit) 



(5.10) 



where the relative entropy density s(i/|/it) exists because fit is asymptotically decoupled (see 
Pfister [21] ) as soon as ^o = A* is a Gibbs measure (see Le Ny and Redig ^19j ) . 

The relation between the non-linear generator and the constrained pressure is now as 
follows. Define the Legendre transform of the constrained pressure as 



Pt i^ll^) = sup 
/eC{Q) 



fdv-pt{f\fi) 



Then the relation with the Hamiltonian in (|4.2p and the Lagrangian in (|5.3|) is 

H{fiJ) 
and 



|pt(/lM) 



i=0 



1 



L{fi,a) = \\m-pl{fi + ta\fj). 



(5.11) 



(5.12) 



(5.13) 



Remark: The operator P, acting on the space C{Q) of continuous functions on fi, has a dual 
operator T>* , acting on the space M.{Vl) of finite signed measures on fi, defined via 



{f,V*fi) = {Vf,fi). 



(5.14) 



In order to gain some understanding for V* (which will be useful later on), we first compute 
2?* for a Gibbs measure /u € "/^"^(ri). Without loss of generality we may assume that the 
interaction potential of /i is a sum of terms of the form ^{A,a) = JAH{A,a), A (^ Z'^ 
finite, where Ja is translation invariant, i.e., Ja+Zc = Ja, k G Z"'. We also assume absolute 
summability, i.e., 

^|Ja|<oo. (5.15) 

Remember that 

(^/)(^) = E ifi^M')) - fi^M))] ■ (5-16) 

Therefore, for the Gibbs measure fi under consideration, we have 



{f^,-Df) 



Xfe^"-^" 



(5.17) 



where //" denotes the distribution of a^ when a is distributed according to fi. Note that the 
sum in the right-hand side of (j5.17p is formal, i.e., the integral is well-defined due to the 
multiplication with the local function /. In terms of J^, A C X finite, we have 






r_, -l\(a)=Y^ (e-^A^o-2JAH^A^J,a) _ A 



(5.18) 



where, once again, this expression is well-defined only after multiplication with a local function 
and integrated over fi. 
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6 Bad empirical measures 

In Section [7] we will see what consequences the large deviation principle for the trajectory 
of the empirical measure derived in Sections [3] and [5] has for the question of Gibbs versus 
non-Gibbs. This needs the notion of bad empirical measure, which we define next. 

If we start our spin-flip dynamics from a Gibbs measure fj, E ^"^(il), then a probability- 
measure-valued trajectory 7 = {'yt)t£[o,T] has cost 

Ml) = K70I/U) + [ L(7i, 7t) dt, (6.1) 

Jo 

where the term s(7o|//) is the cost of the initial distribution 79. We are interested in the 
minimizers of I^ij) over the set of trajectories 7 that end at a given measure i'. Let 

Kt{^j!,v)= inf / L{jt,'yt)dt 

7: ■yo=ti',fT=i^ Jo fr. 2) 

Then e''^N\~^T{tJ- ,i') ^g^-^ ]-)g thought of as the transition probability for the empirical measure 
Cn to go from /i' to v, up to factors of order e°*^' J^l). Hence 

- TTjTj- logP^ {Cn{(Jo) = ix'\Cn{(Tt) = I') 

\^n\ (6.3) 

= [s{,^'\^) + Kt{i^\u)]- inf W\i^) + Kt{ii\v)\. 

Let M*{fi,i') be the set of probability measure /i' for which the infimum in the right-hand 
side of (j6.3p is attained. We can then think of each element in this set as a typical empirical 
measure at time i = given that the empirical measure at time T is z^. When M* is a 
singleton, we denote its unique element by /U*(^, v). 

Definition 6.1. (a) A measure v is called bad at time t i/M*(/i, i/) contains at least two 
elements //i and jjl2 and there exist two sequences {i'n)neN and (z/'^)„gpj, both converging to v 
as n ^ 00, such that ^*{jjl,v}^) converges to jii and fi*{^,u^) converges to ^2- 
(b) A measure v that is bad at time t has at least two possible histories, stated as a two-layer 
property: seeing the measure v at time t is com,patible (in the sense of optivnal trajectories) 
with two different measures at time t = 0. 

Badness of a measure can be detected as follows. 

Proposition 6.2. A measure v is bad at time t if there exists a local function f : il — > M, two 
sequences (f^)nGN CL^d {i^n)neN both converging to v, and an e > such that 

E(/(a(0)) I CN{a{t)) = {iyl)N) -E(/(a(0)) 1 CN{a{t)) = {uI)n)\> e ViV,nGN, (6.4) 

where {vn)N denotes the projection of Vn on T'^. 
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A large deviation view on dynamical Gibbs-non-Gibbs tran- 
sitions 



In van Enter, Fernandez, den Hollander and Redig [5] we studied the evolution of a Gibbs 
measure (j. under a high-temperature spin-flip dynamics. We showed that the Gibbsianness of 
the measure /it at time t > is equivalent to the absence of a phase transition in the double- 
layer system. More precisely, conditioned on end configuration r] at time t, the distribution 
at time t = is a Gibbs measure ^^ with //-dependent formal Hamiltonian 



H''^{a,r^) = H{a) + htY,^iVi^ (7-1) 



where t i-^ /ij is a monotone function with limf|o hf = oo and limt_^oo hf = 0. If the double- 
layer system has a phase transition for an end configuration r], then rj is called bad. In that 
case T] is an essential point of discontinuity for any version of the conditional probability 
AitC^A = ■ IcTA^), A CZ'^ finite. 

The relation between the double-layer system and the trajectory of the empirical distribu- 
tion is as follows. Suppose that the double-layer system has no phase transition for any end 
configuration rj. If we condition the empirical measure at time t > tohe u, then (by further 
conditioning on the configuration r] at time t > 0) we conclude that at time t = we have the 
measure j^ii^v{dr]). Hence the optimizing trajectory is unique. Conversely, if there exist a 
bad configuration r/, then (because of the translation invariance of the initial measure and of 
the dynamics) all translates of r] are bad also. Hence we expect that a translation-invariant 
measure v arising as any weak limit point of |T^| X^^g-j-d 5r^ri is bad also. 

As an example, let us consider a situation studied in [5]. The dynamics starts from fit, the 
low-temperature plus-phase of the Ising model with zero magnetic field, and evolves accord- 
ing to independent spin-flips. Then, from some time onwards, the alternating configuration 
r/ait(a^) = (— l)^i=il^»l becomes bad. The same is true for — r/ait, and so the translation- 
invariant measure 

^=l('^.a.+5-.aJ (7.2) 

has the property that, for z^-a.e. configuration 77, the double-layer system has a phase transition 
when the end configuration is rj. Moreover, the Hamiltonian H^ has a plus-phase /t^ and a 
minus-phase ji^ . Therefore, when we condition on the empirical measure in (j7.2p we get two 
possible optimal trajectories, one starting at ^(/^^ +/it„) and one starting at \{ji^ +/*!„). To 
realize the approximating measures of Proposition 16.21 we choose z/^ , z/^ to be the randomized 
versions of v where we first choose a configuration according to v and then independently flip 
spins with probability 1/n, to change either from minus to plus or stay plus if it was plus to 
begin with, respectively to change to minus or stay minus. Clearly, by the FKG-inequality, 
when conditioning on i/^, respectively, v"^ as empirical distribution, we get a measure at time 
t = that is above 11^ + fi^ , respectively, below /x~ + /il„. Hence ()6.4p holds with f{a) = cjo, 
and v is bad. 
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A A simple example of the Feng-Kurtz formalism 

A.l Poisson walk with small increments 

In order to introduce the general formalism developed in Feng and Kurtz [11], let us consider 
a simple example where computations are simple yet the fundamental objects of the general 
theory already appear naturally. 

Let Xj\f = (Xjv(t))t>o be the continuous-time random walk on M that jumps N^'^ forward 
at rate bN and —N^'^ backward at rate dN, with b,d £ (0,oo). This is the Markov process 
with generator 

{LNf){x) = bN [f {x + iV-i) _ /(a;)] + dN [f {x - N-') - fix)] . (A.l) 

Clearly, if limTv-^-oo Nn(0) = x G M, then 

lim XN{t) = x + {b-d)t, t > 0, (A.2) 

A^— >-oo 

i.e., in the limit as A^ — )• cxd the random process Xjv becomes a deterministic process (2;(t))t>o 
solving the limiting equation 

x = {b-d), x{0)=x. (A.3) 

For all A^ G N, we have 

N 

Xjv(t) = AT-i [U+{Nbt) - J^-{Ndt)] = Y^i^i' - ^f') (A-4) 

with M^ = (A/'^(t))i>o and A/"^ = (A/'^(t))i>o independent rate-1 Poisson processes, and 
Xj, Y^, i = 1,. . . ,N, independent Poisson random variables with mean bt, respectively, dt. 
Consequently, we can use Cramer's theorem for sums of i.i.d. random variables to compute 

I (at) = lim — logPAr(AAr(t) = at | AAr(O) = O) = sup [atX - F{X)] , (A.5) 

where 

1 
(A) = lim 

Af->oo 

Thus, we see that 



F{X) = lim ^logEjv (e^^^'^(*M = b(e^ - l) + die'^ - l). (A.6) 



I{at) = tL{a) (A.7) 

with 

L{a) = sup \aX - b{e^ - l) - d{e-^ - l)j . (A.8) 



Using the property that the increments of the Poisson process are independent over disjoint 
time intervals, we can now compute 

= lim V lim — logP7v(x7v(ti) - XN{ti-i) « %_,{ti - ti_i)) 



(A.9) 
lim Y,iti - ti-i) Li'fU-i) = / H^t) 

1=1 •'^ 



i=l 

' dt. 
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where L is given by (jA.SP and tj, i = 1, . . . ,n, is a partition of the time interval [0,T] that 
becomes dense in the limit as n — )• oo. 

We see from the above elementary computation that, in the limit as N ^ cx3, 



T 



-N L{jt,it)dt 
Jo 



(A.IO) 



^N[{XN{t))telo,T] ~ (7(*))tG[o,T]j ~exp 
where the Lagrangian L only depends on the second variable, namely, 

L(7t,7t) = M7t) (A.ll) 

with L given by (jA.Sp . We interpret (jA.lOp as follows: if the trajectory is not differentiable 
at some time t G [0,r], then the probability in the left-hand side of (jA.lOp decays superexpo- 
nentially fast with A^, i.e., 

jJ™o N ^°g^^((^^(*))t6[o.r] - (7t)tG[0,T]) = -OO, (A.12) 

and otherwise it is given by the formula in (|A.10p (read in the standard large-deviation 
language) . 

The Lagrangian in (jA.Sp is the Legendre transform of the Haniiltonian 

H{X) = b{e^ - 1) - d{e-^ - l) . (A.13) 

This Haniiltonian can be obtained from the generator in (jA.ip as follows: 

H{X) = ^lini^le-^/-(--) (Ljve^-^^)(x), h{x) = Xx. (A.14) 

More generally, by considering the operator 

{nf){x) = lim ^ e-^^(^) (LNe^^) (x) = 6(e-^'(^) - l) - dfe-^'^^) - l) , (A.15) 

we see that the Hamiltonian equals 

H{X) = {'Hfx)ix), (A.16) 

and that, by the convexity of A i— ;► H(X), 

{Hf){x) = H{f\x)) = sup[a/'(x) - L{a)]. (A.17) 

aGK 

The operator Ti is called the generator of the non-linear semigroup. 

A. 2 The scheme of Feng and Kurtz 

The scheme that produces the Lagrangian in (jA.Sp from the operator in (JA.ISP actually works 
in much greater generality. Consider a sequence of Markov processes X = {X]\[)j\f^fq with 
X^ = {XN{t))t>o, living on a common state space (like M, M or a space of probability 
measures). Suppose that X^ has generator Ljv and in the limit as A^ ^ oo converges to a 
process {x{t))t>o, which can be either deterministic (as in the previous example) or stochastic. 
We want to identify the Lagrangian controlling the large deviations of the trajectories: 



PNy{XN{t))te[o,T] ~ (7i)tG[o,T]) ~ exp 
Omitting technical conditions, we see that this can be done in four steps: 



T 



A^ / L{^uit)dt 





(A.18) 
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1. Compute the generator of the non-hnear semigroup 

{nf){x)= Urn le-^^(-) (LNe^f){x). (A.19) 

2. Look for a function H{x,p) of two variables such that 

{Uf){x) = H{x,Vf{x)). (A.20) 

What V/ means depends on the context: on M it simply is the gradient of /, while on 
an infinite-dimensional state space it is a functional derivative. 

3. Express the function H as a Legendre transform: 

H{x,p)=snp[{p,X)-L{x,X)]. (A.21) 

p 

What (•) means also depends on the context: on R'^ it simply is the inner product, while 
in general it is a natural pairing between a space and its dual, such as (/, fJ-) = J fd/j,. 

4. The Lagrangian in (JA.ISP is the function L with x = 'jt and A = 74. 
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